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Abstract 

(N , 

Using the representation introduced in our another paper pQ, the well-known 
Quantum Prisoner's Dilemma proposed in [2], is reexpressed and calculated. By 
this example and the works in |J on classical games and Quantum Penny Flip 
iy-j ' game, which first proposed in [2], we show that our new representation can be a 

f^) , general framework for games originally in different forms. 

Key Words: Quantum Game Theory, Prisoner's Dilemma 
Pacs: 02.50.Le, 03.67.-a 



3 



Introduction — Recently, we proposed a new mathematical represent ation^Q for 



Classical and Quantum Game Theory. It has been shown than iV-player classical 



games, which are traditionally defined by N single-player strategy sets and (0, N)- 
tensor payoff functions, can be equivalently reexpressed in the new representation, by 
a system strategy space and N (1, l)-tensor payoff functions. In the same paper, the 
^ ■ well-known Quantum Penny Flip gamejH] has also been rewritten by the new language. 

In another paper [3], we apply the new representation onto Battle of the Sexes and get 
some interesting results such as entangled strategy equilibrium state. Although our 
new representation is defined as an abstract form, which is believed be able to describe 
any specific games, it seems still necessary to discuss more famous games as examples 
by this new language. So in this paper, we try to describe in the new representation a 
well-known game proposed in [2], the Quantum Prisoner's Dilemma. 

The original Quantum Prisoner's Dilemma — First, we follow the definition in 2 , 
but reexpress it in density matrix form instead of the original state vector form, and 
give the manipulative definition proposed in [H], 

r*°= Ug€B^n® rf >A{i*}Y (i) 

A two-particle quantum system is used as the quantum object in the game, which 
has the Hilbert space formed by base vectors \UU) ,\UD) , \DU) , \DD). Here \UU) 
represents the state that particle 1 and particle 2 stay on \U). We also suppose they 



I 



are distinguishable, named 1 and 2 respectively. The initial state of the quantum object 
is 
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p% = \UU) (UU\ 









(2) 



A typical strategy players can use is 



e 1 * cos 0/2 sin 9/2 
-sin 6/2 e-** cob 9/2 



(3) 



So player l's strategy space is the above unitary operator acting on particle 1, and 
similarly for player 2. The payoff value is determined by 



E l = Tr (& (u 1 ® U 2 } p ((J 1 ® U 2 ^ , 



(4) 



in which, if defined in the base vectors above, the payoff scale matrix P % is 
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So mapping C ( U 1 , U 2 ) = U 1 (g> C/ 2 . Classical pure strategies are 



N c 



1 
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and F c 
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(6) 



If players can only use classical strategies, we can check that the payoff from equ@ 
equal to the classical payoff defined as 
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For example, we check the situation when both the two players choose iV c . 7V C acting 
on \U) gives \U), so the end state of the quantum object is still \UU). So the first 
elements of G l,c and G 2 '° are r and r. 

Quantum Prisoner's Dilemma in the new representation — Now we try to derive 
the abstract formPIS], 



(8) 



The central idea of our new representation is to find a set of base vectors for strategy, 
and to defined inner product between them so as to form them as a Hilbert space. And 
then redefined payoff function as a mapping from the system strategy space to real 
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number. Here, we have four good and natural base strategies. Besides the two classical 
pure strategies in equ©, we still have 



N q 



1 
-1 



and F q 



-% 

1 



(9) 



which we named as quantum base strategies. A general quantum strategy in equ© 
can be expanded as 



U (6, 4>) = cos - cos 4>N C + % cos - sin 4>N q + i sin -F q '-. 

A more general operator can be 

U = £ . N c + x ■ F c + y • F q + z • N q , (£, x, y, z G C) . 
Or if we require s is unitary, in a set of independent parameters 0], 



(10) 



(11) 



U = cos — cos 

2 2 



-i sin — sin 

2 2 



b — ism - cos 
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2 



a + /3 



The inner product is defined as 



s, s 



Tr ( s^s' 



- — -F q -i cos 1 sin ^^Af 9 . 
2 2 2 

(12) 



(13) 



Then (N c ,F c ,N q ,F q ) are orthogonal and normalized. Later on we denote them as 
base vectors such as \N C ). A system strategy space is the direct product space of the 
two players, so it has 16 base vectors such as \N C , N c ). A state in the system strategy 
space can be 

\S) = \s\s 2 ). (14) 
Now we try to define (1, l)-tensor payoff matrix H l so that 



E l (S) = (S\ H % \S),VS. 



(15) 



In a general procedure has been proposed, that first to define its elements on a 
specific set of base vectors, then prove it can be used for any states. Now elements of 
the payoff matrix is defined 



H. 



a/3 



{a\H* 



H l \f3\f3 2 ) = Tr (p* ((3 1 (3 2 ) p (a 1 ® a 2 ) ] 



fa 1 , a 21 « 2 



(16) 



in which a l ,f3 l are anyone of the predefined base vectors (N c ,F c ,N q ,F q ). Before we 
calculate all the values of the elements, we need to prove the definition in equ(|16jl 
guarantee equ^SJ) is valid for any strategy. 

Theorem Suppose \S) = \s 1 ,s 2 }, Vs 1 ,^ 2 , for the payoff matrix H % defined in equ ()16|) . 
prove that E i (S) = Tr (p i (s 1 ® s 2 ) p (s 1 ® s 2 ) ] ) equals (S\ fl*|5) = (s 1 ^ 2 ) W \s l ,s 2 
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Proof If s^s 2 are the base vectors, this is just the definition of H i . So it's obvious. 



We claim that (s 1 ® 



[s 1 ) t 



s ^ + s i ® 4 + 4 ® s 



= • 03 s~ ana 

The proof of a general 



strategy S will need all of these relations, which are easy to check. Now we suppose 
s * = E^JJm)- Then 



Tr P 



J 1 <8> s 2 ) po (s 1 ® s 2 )* 



»2\ t 



Tr | pi ^2 x i x 2^ , 



^po E^ 



xjx 2 ^ (8) 77' 



and 



(S\ H* \S) 
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E^,iy,^,jj x li x i x \ x q (£) ?7 l-^l A* 



So they are equal, and we get eau H15|) . The payoff matrix H , H 2 in the base vectors 
(N c ,F c ,Ni,Fi) are 
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(17) 
(18) 
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For classical game, the base vectors are only N c , F c , then the sub-matrix are 
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(19) 



They are equivalent with the payoff matrix directly reexpressed into our new represen- 
tation from G 1,c , G 2,c . 

Density matrix form of the game — For a quantum system state, eau lfT5)l can be 
used to calculated the payoff value. But a classical mixture strategy with probability 
p l nc on N c and p l j c on F c is impossible to rewritten as the vector form as equ (|ll|) . In 
order to compare quantum strategy with classical strategy, we have to define a more 
general strategy form. In ill, a density matrix form is used, such as 

P S '° = {p 1 uc\N c ) (N c \ +p 1 nc \N c ) (N c \) (p 2 nc \N c ) (N c \ +pl c \N c ) (N c \) . 

In fact, this density matrix form can be applied onto both classical and quantum 
strategies. So quantum mixture strategy is permitted to use by a quantum player. 
Then the payoff value equ l)15|) turns into a density matrix form, 



E i = Tr . 
Now our classical and quantum Prisoner's Dilemma is redefined as 



(20) 



r<?= (S 1 - 9 (g> 5 2 -9, (H 1 'i,H 2 'i)) and r c = (S 1 ^ <g> S 2 ' c , (H 1 * , H 2 > c )) . (21) 

The classical game is defined in a subspace of the quantum game, and the classical 
payoff matrix is the sub-matrix on the subspace. 

Equilibrium state of the game — Now we have shown that our language can be 
used to discuss this game. Although calculation of NE and a general algorithm is 
not the main topic of this paper, finding some solutions and comparing them with the 
solutions given in their original frameworks is quite attractive. In Nash Equilibrium 
is redefined and a Global Equilibrium State (GES) is proposed. And in [2j, a Pareto 
optimal state ((/ (0, vr/2) , U (0, 7r/2)J is found. Now we try to check if there is a GES, if 
not, if there is some other state which can be used to beat the Parato optimal strategy. 
A Nash Equilibrium State pf q is defined as 



& {p s eq ) >E* (Tr* (p^).p%VW 



(22) 



in which Tr 1 (•) means to do the trace in player i's strategy space. If system state is a 
direct product of all single-player states, 



■eq 



(23) 
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then definition in eau ()22j) is equivalent with traditional NE, 



E l [p 8 eq ) >E* (p 1 eq ---p i ---p? q ),Vi,Vp i 



(24) 



A special case of the first definition is 

& {p s eq J>E*(p s ),Vp s ,Vi. 



(25) 



Although it is not always possible to find such a state pf 9im , if the game has one such 
state then it is a dominant strategy, so we named it GES jKj . The reduced payoff matrix 
H* R is the reduced matrix of H l when all other players' strategies are fixed, 



m 



Tr_ i {p l ...p i - 1 p i + l ...p N W) 1 



(26) 



where Tr_j (•) means to do the trace in the space except player i's space. 

If both H 1 and H 2 have a common eigenvector, which has the maximum eigenvalue 
in both the two payoff matrix, then this state is a GES. It's probably an entangled 
strategy state like the one of the game in [3]. Here we check if such state exists in 
this game. Both H 1 and H 2 have eigenvalues 4s, 4r, Ap, At and other 12 zeros. The 

12 12 12 1 

corresponding eigenvector \At) ^ \At) and |4s) ^ 1 4s) , but |4r) = |4r) and \ Ap) = 
\Ap) 2 . When t > r > p > s, there is no GES, but \S m ) = \Ar) 1 = \Ar) 2 are the system 
state with second-maximum eigenvalue, on which both player 1 and player 2 get Ar. 
The vector form of \S rn ) is 

\S m ) = - (\N C N C ) + \N c N q ) + \N q N c ) + \N q N q )) = - (\N C ) + \N")) (\N C ) + \N q )) , 
or transfer it back into matrix form 
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So S m is not a unitary operator, although it leads to higher payoff, it might be unappli- 
cable. And even it's applicable, it's not a NE. Because, player i can get more payoff by 
adjust its own strategy. The role of such system state is that everyone knows it's not a 
best choice, but a good choice if both players can keep staying on such state, just like 
the |A^ C A^ C ) state in classical prisoner's dilemma. Leaving from such state will at least 
decrease the payoff of one player. Also state \Ap) = \Ap) 1 has such similar property. 

Now we discuss the reduced payoff matrix when player 2 or 1 choose U(02,(p2)- 
From eau(|26|). 








ei 













i€2 


ei 





ei 








-ie 2 





(2 



(27) 
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in which ei = ssin 2 A #2 + rcos^ 



psin 2 i# 2 + t cos 2 5^2- 



It has eigenvalues 



{2e2, 2ei, 0, 0}. When t > r > p > s, the 2e2 is the maximum eigenvalue for any 62- 
The corresponding eigenvector is (0, i, 0, or in matrix form, 





\/2i 



which is obviously not a unitary matrix. Although it leads to higher payoff, it's not 
applicable. The same situation happens to player 2, so the payoff value of the both 
players will be 4p. So non-unitary operators space gives a new NE, ^E 1 = 4p = E 2 ^. 

Now we limit out strategies in unitary operator space. For a general unitary operator 
strategy U [9\, 4>i), the payoff of player i is 



E' 



t cos — — - + p sin 



'(3-0 



sm - + 



2 V(3-i) . . 2 ^(3-1) 
r cos — 1 + s sm — - 



2 

cos — . 
2 



(28) 

Since the first term is larger, the best response is 6\ = ir. Similarly, when player 1 is 
fixed at U (#1, <p\). the best response of player 2 is U (ir, fa) = iF q . Therefor, the NE 
in quantum unitary strategy is 



\S) = \E 1 =p,E 2 



P 



(i\F*)i\F«)). 



(29) 



However, this NE strongly depends on equ©, because we have more unitary operators. 
In the whole unitary operator space defined by equ (|T2"|) . with the parameters a,f3,j, 
the payoff of player i is still in the form of eau(|28|). in which 0i is replaced with 74. So 
it's still independent of a, (3. Therefor, NE in this whole unitary operator space is 



\E 1 =p,E 2 =p) = U (7 = 7r,ai 5 p x ) ,U (7 = 7r,a 2 , fh) 
where in operator form, 

U (7 = 7T, a\ , (3i ) = —1 sm ■ 



a 



— b + 1 sm 1 y . 



(30) 



(31) 



So for our Quantum Prisoner's Dilemma, the payoff from Quantum Nash Equilib- 
rium is the same with the payoff from original Classical Nash Equilibrium. In this 
sense, we say the quantization does not solve the dilemma, although instead of only 
one NE in classical case, here the quantized game has more NEs (Va,/3). But if non- 
unitary operators are applicable, we will have better NEs, such as \E^ = Ap = E" 2 ) and 
\E 1 = 4r = E 2 ^. And further more, if entangled states are permitted, the former NEs 
will not be NEs anymore. In some cases, even GES can be foundj^]. Usually, such GES 
will not be a direct product state, so it includes correlation between players. This prop- 
erty looks like cooperative behavior. Although this game has no GES, it's still probably 
to find general NE as equ (|22|) in this entangled strategy space. Unfortunately, we have 
no applicable algorithm for such general NE. But if we can find them, it will probably 
have bigger payoff and also be unitary in the system space. Then, we can say, a general 
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NE in entangled strategy space solves the dilemma. However, of course, since it't not a 
direct product state, it implies that something like negotiation and agreement are the 
real reason to solve the dilemma. Anyway, even that, it's a good news, which means 
our representation is hopefully a way from non-cooperative game to cooperative game. 

Conclusion — Now we see the new representation can be applied onto the Quantum 
Prisoner's Dilemma. In classical strategy space, it gives the same result with traditional 
language, \E 1 = p = E 2 }; in quantum unitary operator strategy space, some new NEs, 
which have the same payoff with classical NE, appears, \E 1 = p = -E 2 ); in general 
quantum strategy space including non-unitary operators, two new NEs appear in such 
game, JS 1 = Ap = E 2 } and \E 1 = 4r = -E 2 ), but they are non-unitary operators; and at 
last, if entangled strategy is permitted, the game here has no GES, but still probably 
has NE. Unfortunately, now we have no way to get such NE. The existence of NE 
in such strategy space calls more investigation. In fact, the definition proposed here 
is for general NE in any strategy space, but without a proof of the existence and 
no applicable algorithm. And frankly, we even have no idea if such general NE is 
meaningful or not, because it requires non-direct-product state and/or non-unitary 
operator. However, the point is no matter whether they have applicable meaning or 
not, questions in Game Theory can be discussed in our new representation. Hopefully, 
one day, it will bring new stuff into Game Theory. And it should be able to prove 
that for all linear-probability-combination classical game and all linear-and-anti-linear- 
amplitude-combination quantum game, the new representation is always validly. 
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